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We derive a version of the adiabatic tlieorem tliat is especially suited for applications in adiabatic quantum 
computation, where it is reasonable to assume that the adiabatic interpolation between the initial and final 
Hamiltonians is controllable. Assuming that the Hamiltonian is analytic in a finite strip around the real time 
axis, that some number of its time-derivatives vanish at the initial and final times, and that the target adiabatic 
eigenstate is non-degenerate and separated by a gap from the rest of the spectrum, we show that one can obtain an 
eiTor between the final adiabatic eigenstate and the actual time-evolved state which is exponentially small in the 
evolution time, where this time itself scales as the square of the norm of the time-derivative of the Hamiltonian, 
divided by the cube of the minimal gap. 



I. INTRODUCTION 



The adiabatic approximation, with its long history [1,2], has played a central role in quantum mechanics. This approximation 
states, roughly, that for a system initially prepared in an eigenstate (e.g., the ground state) |$o(0)) of a sufficiently slowly varying 
Hamiltonian h{t), the time evolution governed by the Schrodinger equation = h{t)\4'{t)) will approximately keep the 

actual state of the system in the corresponding instantaneous ground state |$o(i)) of h{t), provided that there are no level 

crossings. Quantitative statements of this approximation have a long history, with rigorous results appearing only in recent years. 
Let \^j{t)) ij e {0, 1, 2, . . .}) denote the instantaneous eigenstate of h{t) with energy ej{t), i.e., h{t)\^j{t)) = ej{t)\^j{t)). 
The simplest and one of the oldest traditional versions of the adiabatic approximation states that the Hamiltonian must be slow 
with respect to the time scale dictated by the ratio of a matrix element of the time-derivative of the Hamiltonian to the square 
of the spectral gap d = mmQ<t<T:j>o[sj{t) — c-o{t)] [3]. Namely, the fidelity between the actual final state and the adiabatic 
eigenstate satisfies |(V'(T)|<i>o(r))P > 1 - e^, where e = ina,xo<t<T:j>o[\{^j{t)\hit)\^o{t))\/(P]- Unfortunately this simple 
criterion — while often useful, and widely used (e.g., in Refs. [4-6]) — is in fact neither necessary nor sufficient in general, 
as recently confirmed experimentally [7]. The inadequacy of the traditional criterion is well known and the criterion has been 
replaced by rigorous general results as can be found, e.g., in Refs. [8-12], and in the presence of noise, in Ref. [13]. All these 
rigorous results are more severe in the gap condition than the traditional criterion, and they involve a power of the nortn of time 
derivatives of the Hamiltonian, rather than a transition matrix element. In this work we revisit the adiabatic approximation and 
prove a version of the adiabatic theorem that is motivated by recent developments in the field of quantum information science 
[14], namely the idea of adiabatic quantum computation (AQC) [4, 5]. 

AQC offers a fascinating paradigm for exploiting quantum mechanics in order to obtain a speedup for classically difficult 
computational problems. In AQC one solves a computational problem by adiabatically modifying a Hamiltonian whose initial 
ground state j^olO)) encodes the input and whose final ground state \^o{T)) encodes the output. The time T taken to reach the 
final ground state is the "running time" of the quantum adiabatic algorithm, which one would like to minimize while at the same 
time minimizing the distance S between the actual final state \tp{T)) and the desired final ground state |<I>o(T)). This, of course, 
is the subject of the quantum adiabatic theorem, and provides the motivation for the present paper In quantum computation one 
is interested in how T scales with problem size, which is typically encoded into the system size needed to represent the problem, 
e.g., the number n of quantum bits ("qubits"). Thus one expects both T and S to scale with n. Our goal is two-fold. First, to 
determine the scaling properties of T and 6 with n. Second, to show that 5 can be made exponentially small in T. 

AQC was first proposed as an approach to solving optimization problems such as satisfiability of Boolean formulas, by 
encoding a cost function into the Hamiltonian [4, 5]. However, it was soon realized that AQC is not limited to optimization: from 
a computational complexity perspective AQC is equivalent in power to all other models for universal quantum computation [15- 
19]. Namely, AQC and the other models for universal quantum computation can simulate one another with at most polynomial 
resource overhead. One of the reasons AQC has generated much interest recently is that it has a rich connection to well studied 
problems in condensed matter physics. For example, because of the dependence of T on the minimal gap, the performance of 
quantum adiabatic algorithms is strongly influenced by the type of quantum phase transition the same system would undergo 
in the thermodynamic limit [20, 21], thus offering an interesting perspective on the connection between quantum information 
methods and problems in condensed matter physics. AQC also appears to offer advantages over classical simulated annealing in 
finding an approximate ground state energy of a complex system [22]. 

To date, most AQC studies have relied on the traditional version of the adiabatic approximation. The main question we wish to 
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address in this work is: what is the rigorous tradeoff between a small final error 5 and the scaling of the final time T with system 
size n, or any other relevant parameter? The reason one expects a dependence on n is clear from the "traditional criterion": 
the gap d and h{t) both depend on n. It has been known for some years in the mathematical physics literature that exponential 
accuracy in the form 6 < e^"^-^ (c is some constant) is possible [9, 1 1], but the cost in terms of physical resources such as system 
size n has thus far not been quantified in rigorous proofs of the adiabatic theorem, even those specifically aimed at AQC [12, 13]. 
However, in AQC an understanding of the scaling of the running time with respect to problem size is of paramount importance. 

It is further important to stress that in AQC one deals with "designer Hamiltonians", which contain controllable parameters 
beyond what is typically assumed in developments of the adiabatic approximation or proofs of the adiabatic theorem. That is, one 
envisions any number of "control knobs" which allow one to realize the goal of transforming the initial to the final Hamiltonian. 
Here we prove a version of the adiabatic theorem which focuses on one such control knob: the number N of vanishing initial and 
final time derivatives of the Hamiltonian. Furthermore, we assume that the Hamiltonian satisfies certain analyticity properties. 
Our theorem states, roughly, that for such Hamiltonians, the deviation between the final state and the desired (non-degenerate) 
adiabatic eigenstate can be made exponentially small in N, with a running time that scales as as a polynomial in N times the 
square of the supremum of the norm of the time derivative of the Hamiltonian, divided by the cube of the minimal gap. Since 
the scaling of supt \\h\\ and of the gap d can be quantified in terms of the system size n for situations of interest in AQC (see 
Section V B), this provides an answer to the scaling question we posed above, and our result should be particularly useful for 
AQC applications. In proving this result we rely on the adiabatic exponential error estimate and asymptotic expansion due to 
Hagedorn and Joye [11]. 

The structure of this paper is the following. We begin by stating our notation, definitions, and technical assumptions in Section 
II, and conclude this section with a statement of our version of the adiabatic theorem, including some remarks. Before proving 
this theorem, we provide pertinent background in Section III, including a brief summary of key results from Ref . [11]. Section IV 
is devoted to the proof of the adiabatic theorem. We provide a discussion in Section V, including a comparison of our version of 
the adiabatic theorem to some of the results of Refs. [12, 23]. In the same section we also analyze the scaling of the running time 
and error with system size, give an explicit result (Corollary 10) for second order quantum phase transitions, find the minimum 
adiabatic time T by optimizing the adiabatic interpolation and give an application of our adiabatic theorem in the open system 
setting (Theorem 4). We conclude in Section VI with some remarks about future directions. The reader who is not interested in 
the (lengthy) details of the proof of our adiabatic theorem can safely skip from the end of Section II to the discussion in Section 
V. 



II. ADIABATIC THEOREM WITH ARBITRARY ACCURACY 

In this section we state our adiabatic theorem. But before doing so, we introduce the requisite notation, definitions, and 
assumptions. 

A. Schrodinger equation in dimensionless units 

Let us start with the time-dependent Schrodinger equation 

ot 

where we work in units of ft = 1. Define the dimensionless Hamiltonian H as 



H = hi J, (2) 

where J is an arbitrary energy unit relative to which we shall express all other dimensional quantities.' Let us fix the final time 
T and define the dimensionless rescaled time r as 

T = t/T = eJt. (3) 
This expresses the fact that for any given T we can always choose J so that e, defined via 

e = 1/{JT), (4) 



Since J is arbitrary it will drop out at the end of the analysis, when we reintroduce dimensional units. 
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is a small number, which we need for the asymptotic expansion below. The Schrodinger equation in dimensionless units now 
reads 

ie^=H{T)i;{T). (5) 
From now on most of our calculations will be done in dimensionless units. 



B. Assumptions concerning the Hamiltonian, target state, and initial state 

We shall need three assumptions. The first sets the stage for the family of Hamiltonians we shall be concerned with in this 
work. A Hamiltonian is by definition a self-adjoint (i.e., Hermitian) operator for real-valued times r, and we shall be concerned 
with analytic continuations of the Hamiltonian. 

Assumption 1 {H{T)}Y{aTe[o.oa) one-parameter family of bounded Hamiltonians of an n-body system, with the separable 
Hilbert space 7i„ = 7i®". ^ Let 7 > denote the distance to the pole or branch point o/{iJ(T)} that is nearest to the real 
T-axis in the complex T-plane. The family {H {t)}h^t-^[q^^^ admits an analytic continuation to an open set = {t : |r| < 
7} U {r : I Im(r)| < 7,Re(r) e [0, 1]} U {r : |t- 1| < 7}, as depicted in Fig. 1? 

Example 1 In linear interpolations of the type H(t) = xq{t)Hq + xi{t)Hi (with Hq and Hi constant Hamiltonians such that 
H{0) — Hq and H{1) — Hi), often used in AQC, if Xq and Xi are real-analytic functions, we can usually perform an analytic 
continuation. The height 7 is dictated by possible singularities that appear because of complexification of the functions xq and 
x\. E.g., xo{t) = (1 — t)/(1 + r^) and xi{t) = 2t/ (1 + r^) (satisfying xo(0) = a;i(l) = 1 and xo{l) = xi{0) = Oj, are 
real-analytic functions of t € M, but have singularities at t = ±i. In this case, analytical continuation is possible within S-y as 
depicted in Fig. 1, with < 7 < 1. 

We denote the spectrum of an operator A by a (A). While our analysis applies equally well to finite and infinite-dimensional 
but bounded systems (dimTY < cx) or = 00, respectively), for most applications the Hamiltonians of interest to us are those that 
are usually considered in quantum information theory, such as spin lattices with exchange interactions, for which dim H < 00 
and a{H{T)) is discrete. Whenever our analysis below can be simplified by this assumption, we will explicitly assume that 
the instantaneous eigensystem associated with the discrete spectrum (t{H{t)) is {Ej{T), I'&j (T))}*lg^. Therefore the spectral 
theorem allows us to write; 

M-l 

H{t)= J2 E,{t)\^,{t)){<^>,{t)\, (6) 
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FIG. 1: Region of analyticity S~f of the time-dependent, analytically continued Hamiltonian H{t). The x symbols denote possible poles or 
branch points of H{t). 



^ A "separable" vector space is one that admits a countable orthonormal basis. 

^ We assume analyticity in the sense of Kato [34][p.l4]. Note that if H{t) is afamily of bounded operators, analyticity in the sense of Kato is equivalent to the 
definition of bounded operator- valued analytic functions. 
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where ($j(r)|$j (r)) = 5ij, and M = dim(7Y„) = (dimTY)". For notational convenience we will often write |<I>o(t)) = 
|$(r)), and Eq{t) = E{t), though the "target" instantaneous eigenstate \^{t)) need not necessarily be the ground state (see 
Fig. 2). The labeling in Eq. (6) is chosen such that it preserves ordering of the eigenvalues at the initial time (eigenvalues are 
continuous for all real t, but not necessarily differentiable). This means that we allow the eigenvalues to cross [except with 
E{t)] (see Assumption 3). 

Our second assumption further constrains the class of Hamiltonians: 

Assumption 2 It is possible to set any given number of the derivatives of the Hamiltonian H{t) to zero at the initial and final 
times. 

This assumption is reasonable for "controllable" Hamiltonians of the type one envisions in AQC. We shall see that it will play 
a key role in constraining the error and in enforcing the right initial condition for AQC. 
Our third assumption concerns the properties of the target state. Let 

dist(X,r)= inf (7) 

for any pair of sets X,Y <Z<C: 

Assumption 3 The "target" state |$(t)), with the corresponding eigenvalue E{t), is a nondegenerate and isolated eigenstate 
ofHir). 

"Isolated" means that the spectrum (j{H{t)) has a gap structure around E — Fig. 2 — with a non-vanishing distance from 
the rest of the spectrum, i.e.: 

Ao(t) = dist {{E{t)},<j{H{t))\{E{t)}) > Vr. (8) 
The minimum dimensionless spectral gap is 

A= inf Ao(r), (9) 

Te[o,i] 

while in dimensional units we denote the minimum spectral gap by 

d = J A. (10) 
We shall often require the following quantity in our analysis: 

( = sup ll^ll, (11) 

where || • || denotes the standard operator norm defined in the next subsection. Note that ^ has units of energy. The projector 
onto the target subspace, and its complement, will also play important roles in our analysis: 

P(r)^|<i>(r))($(r)|, (12) 
Pi(T) = /- |$(t))($(t)|. (13) 

In terms of these projectors, Eq. (6) can be replaced by the more general representation 

H{t) = EiT)P{T) + P^{T)H{T)P^iT), (14) 

which does not depend on the existence of a discrete spectrum. 




FIG. 2: Assumed gap structure in the spectrum of h{r). 
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C. Norms and notation 

We use the standard operator norm defined as the maximum singular value, which reduces for diagonalizable operators X to 
[24]: 

||X|| = sup \{v\X\v)l (15) 
lh"ll=i 

where X : V — ^ V, V = span{|m)} is a linear inner-product space with vectors \v) = J2m ^m l'm), for which 



V 



is the standard Euclidean norm and (m|m') — 6mm'- We shall repeatedly use the submultiplicativity property of the operator 
norm [24] : 

ll^^^ll < (17) 

The minimum gap, the norm of the Hamiltonian and the rate at which the Hamiltonian changes, scale with the number n 
of subsystems. The parameter n is important for AQC because it represents the size of the computational problem. In the 
thermodynamic limit, systems with interesting computational properties undergo a quantum phase transition, i.e., the system 
will become gapless and the norms of the Hamiltonian will diverge. But for every finite n the system is finite and the gap can be 
assumed to be finite. Thus we introduce notation to make explicit that these quantities vary with n: 

For all finite n the quantities 1/A, \\H\\, and \\H\\ are finite; 

1/A = A{n) < oo, (18) 
sup = /3{n) =^{n)/ J <oo, (19) 

re[0.1] 

sup \\H\\ = r]{n) < oo, (20) 

rG[04] 

where dot denotes d/dr. We can always take A, 13,7] > 1. 

D. Adiabatic Theorem 

We are now ready to state our main result. We wish to quantify the error in the adiabatic approximation, i.e., the distance 
between the actual final state \tlj{T)) and the target state |$(T)) (both normalized). Since in quantum mechanics \ip{T)) and 
|$(T)) ai-e rays in the projective Hilbert space VHn [i.e., we identify e''^\ilj{T)) with \tlj{T)) and e'^\^{T)) with |$(T)), 
where x G K are both arbitrary phases], the appropriate distance is D ^ arccos / (the Fubini-Study distance), where / = 
I (■(/'(T) |$(r)) I is the "fidelity". As we shall see later [Eq. (48)], we shall obtain bounds for 

6 ^ IMT)) ^ e^^MT))l (21) 

where % G R is a (dynamical) phase. Writing {■ip(T)\^{T)) = J^e*^, where < { < 1 and e K, we have 
S = ^/2]^^^'T'(X)s{x+'^y\■ Thus as long as cos(x + 9) > 0, an upper bound on 5 is a lower bound on T, and hence an 
upper bound on D. From now on we shall be concerned exclusively with the distance 6. 

Theorem 1 Given assumptions 1-3, and that the first + 1 derivatives of the Hamiltonian vanish at t — and t — 1, a final 
time T which scales as 

T^'-N^L., ill) 

where the "time dilation" q > 1 is a free parameter, yields an adiabatic approximation error which satisfies: 

5<{N + ly+^q-^. (23) 
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Thus, the adiabatic approximation error is exponentially small in the number N of vanishing iJ-derivatives, for a given time 
dilation q. 

In several other papers, e.g., Refs. [9, 11], exponential error estimates in T that behave as S < e""^^ have been proven, 
where c is some system-dependent constant. This applies in the setting where T is treated as an independent parameter and 
one considers a constant (time-dependent) Hamiltonian in a fixed Hilbert space. However, there are some important differences 
between our result and the works cited above. First, neither of Refs. [9, 1 1] extracted the time scale from the small parameter e 
used in their expansions, which means that no direct physical interpretation of these results was given in terms of the adiabatic 
time. As we shall see, e alone does not have a physical meaning. It is the combination of e with the dimensional Hamiltonian 
and gap that will yield the time scale for the magnitude of the error For the same reason, J drops out [see Eq. (112)]. Second, 
in Ref. [11] the error is not with respect to the instantaneous adiabatic eigenstate, but with respect to the superadiabatic basis, 
which has no immediate use for AQC. Let us now recast Theorem 1 in the form of an exponentially small error as a function 
of T, by eliminating N between Eqs. (22) and (23). In doing so we focus on the (simplified) exponential part (q^^), since for 
N ^ 1 the polynomial factor {N + 1)'^^^ will be suppressed in comparison to the exponential factor This gives an expression 
involving g"^/*, which is minimized when q = e. We thus have (after re-inserting the polynomial factor): 

Corollary 1 Under the assumptions of Theorem 1 the adiabatic error satisfies 

S<{cT+ iy+^ e-=^, (24) 

where 

c=^. (25) 

Thus our analysis reproduces the exponential error estimates obtained previously, but makes the dependence on the gap and 
rate of change of the Hamiltonian explicit. 

Finally, in applications to AQC it may be sufficient to have a fixed error, as long as this corresponds to a success probability 
greater than 1/2. It then follows from the Chernoff inequality [25] that one can then boost the confidence in the correct answer 
by repeating the adiabatic computation algorithm a few times."* By fixing an upper bound Su for the error, i.e., setting S < Su = 
{N + l)'^^^q~^ , we can eliminate q and relate T directly to 

Corollary 2 Under the assumptions of Theorem 1 a final time T which scales as 

r=(J„)-^l7V(iV + l)^|^, (26) 

yields an adiabatic error which satisfies: 

5<5u. (27) 

This corollary demonstrates that the adiabatic time T is insensitive to the adiabatic error (in that it depends only on its A^th 
root). 

Remark 1 The n-dependence of both ^ and the gap d is an important aspect of our result. It will be made explicit in Section 
VB. We also note again why we are interested in the N-dependence ofT and S. The reason is that we view N as a controllable 
parameter, which an experimenter or quantum algorithm designer can vary in order to optimize T and 5 ( see Section VB for 
more details). 

Remark 2 The value ofj (distance of the nearest pole of the Hamiltonian in the complex time plane from the real time axis) 
may, in general, depend on the system size n. However, under certain natural assumptions for AQC which we specify in Section 
VB, 7 is n- independent, so that the entire n-dependence ofT comes from ^ and the gap d. 

Remark 3 Because of our analyticity assumption, it is important that only a finite number N of derivatives of the Hamiltonian 
vanish at the initial and final times, for otherwise Liouville's theorem would imply that H{t) would have to be constant (within 
its analyticity domain). 

Before proving Theorem 1 we briefly discuss resolvents and collect the pertinent results from Ref. [11] in Section III. 



See Ref. [26] for an extensive discussion of related issues in quantum estimation algorithms. 
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III. BACKGROUND 
A. Resolvents 



Definition 2 The (full) resolvent of H is: 



It is defined on the resolvent set 



R{t,z) = [H{t)-z]-\ (28) 



p{H{t)) = C - <j{H{t)). (29) 



The resolvent is an analytic function of z on p{H), and where H{t) is an analytic function of r, i?(r, z) will be an analytic 
function of r as well [27]. By differentiating the identity {H{t) — z)R{t, z) = I, we obtain 

i?(T, z) = -i?(r, z)H{t)R{t, z), (30) 

assuming that z and r are independent. 

Definition 3 The reduced resolvent is a map from the full Hilbert space to the orthogonal complement of the target subspace: 

Grir) = i[H{t) - E{t)];^ ■.Hn^ni= 7^[Px(T)], (31) 
(where TZ denotes the range) and is defined via 

Gr{T)[H{T) - E{t)] = [H{t) - E{T)]Gr{T) = iP^{t). (32) 
An explicit representation can be given in the case of a discrete spectrum [Eq. (6)]: 

Gr{r) - E (33) 

j>0 

where Ajo is the jth energy gap from the target state: 

A,o(T)^i?,(r)-i?(r). (34) 

Also note that: 

Grir)P^{T) = P±{T)Gr{T) = G,(t). (35) 

B. Summary of main results from Ref. [11], and a modification 

The main results we shall need from Ref. [11] are their Eqs. (2.10)-(2.19), which we reproduce here for convenience. First 
is the asymptotic expansion of an approximation to the full solution of the Schrodinger equation, where the zeroth order is the 
target adiabatic eigenstate: 

\^Nir,e)) = e-'/o^^Mrf-V^(|a,(r))+e|^i(r))+e2|^2(r))+... + e^|^Ar(r))+e^+Vi^+i(T))). (36) 

The parameter e is the one defined in Eq. (4). The vector |4'Ar(T, e)) first appeared in Berry's work on the "superadiabatic 
approximation" [28], and is designed to provide a close approximation to the actual time-evolved state, under adiabatic evolution. 
We shall call |^'Ar(T, e)) the "superadiabatic state", but it is important to note that it is not normalized, while |<I>(t)) is. By 
inserting the superadiabatic state into the Schrodinger equation, and by choosing the phase of |$(t)) so that 

($(t)|$(t)) =0, (37) 
the following expressions have been obtained in Ref. [11]: 

= fjir)mr)) + |V^^^(r)), 1<J<N (38) 

li^lir)) = Grir) (/,„i(T)|ci>(r)) +P^(r)|^/_i(r))) ; iV^o^M) ^ 0, (39) 
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where it follows from Eq. (35) that 

($(t)|V|(t)) =0, 1< j<7V+l, (40) 

explaining the "_L" superscript. 

The functions fjir), I < j < N, are determined by integration from their defining equation fj = —{^\ipj-) [Eq. (2.11) in 
Ref. [11]] up to a constant c^: 

Mr) = 1, (41) 
Mr) = - r dT'mT')\dr'i^Hr'))+c, 



dT'{dr'Hr')\ij^{T'))+Cj, (42) 

where the last equality follows from integration by parts and Eq. (40). In Ref. [11] the constants Cj were chosen to be zero, i.e., 
fj (0) = 0, 1 < j < A^. For our purposes they are chosen so that 

= 0, l<j<iV, (43) 



., we choose Cj = dr' {<^>{T')\dr'i'j-{r')) for l<j< N. 
Equation (39) can also be written in either of the following forms: 



H't) = GrP±[f,-i\^) + \i^t-i)) 



Grif,-l\^)+H'U 



= G,|V;^^_i) -/,_iG,,|$). (44) 

The first equality is to emphasize that G,. (and its derivatives) always come with a P± (although Pi is already included in G^). 
The last equality follows from Eq. (74) derived below. 
Further, according to Eq. (2. 19) in Ref. [11] we have: 

\mT,e)) - \^^{T,e))\\ < AN{T)e^+\ (45) 

where 

AN[r) < J II — ||dr . (46) 

The initial condition used in Ref. [11] is |-0(O, e)) — |^'jv(0, e)). Using Eqs. (36) and (38) we thus have |-0(O,e)) = 
Ej^o eVj(0)|$(T)) + Ej^V e^lV'/(0)). We shall show in Section IVB that we can make |V'/(ti)) for 1 < j < + 1 
by setting the first + 1 derivatives of H to zero at ri, and in particular at ti = 0. Thus we have 

TV 

|^(0,e)) =^9|<f(T)), ^ = ^eVj(0), (47) 

j=o 

and since |^(0, e)) and |$(t)) are both normalized states, it follows that the initial state is the target state up to a pure phase 
factor ■& & C, namely |t?| = 1. 



IV. PROOF OF THEOREM 1 

A. Proof strategy: two "adiabatic distances" 

The central quantity of interest in the adiabatic theorem is the distance between the exact solution \iP{t, e)) and the target 
eigenstate |$(r)), at the final time. We define this "adiabatic distance" up to a phase (recall the discussion at the beginning of 
Section II D, where this phase was denoted by x): 

<5^|||^(l,e))-e-i^'^(^')'^->(l))||. (48) 
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Using the triangle inequality we have 

S < S1+S2 (49) 

5i = \mi,e))-\^Nil,e))\\ (50) 

62 = |ie^/o^(^')'^^>Ar(l,e))-|$(l))||. (51) 

Our strategy is to bound S by considering and $2 separately. We shall show that 61 can be bounded by a quantity that decreases 
exponentially in the asymptotic expansion order N. This will require the use of the analyticity of the Hamiltonian (Assumption 
1). And, we shall show that 62 can be made to vanish by imposing that the first + 1 derivatives of the Hamiltonian vanish at 
the final time (Assumption 2). We start with 62 and the initial condition. 

B. Role of boundary conditions on derivatives of tlie Hamiltonian 

We now show that by introducing boundary conditions on the derivatives of H, we can impose that the initial |V'(0)) is the 
eigenstate |$(0)) up to a phase, and that S2 = 0. The technique of imposing boundary conditions on the derivatives of H is 
inspired by the old work of Lennard [30] and Garrido and Sancho [31] (see also Nenciu's somewhat more recent work [32, 33]). 

Lemma 1 IfH'-''^ (ti) = 0/or all 1 < k < N and for some ti e [0, 1] then 

|^/(ri)) =0, je {1,...,7V}, (52) 

The proof is given in Appendix A. Using this Lemma we have: 

Corollary 3 //ijC") (0) = 0/or all 1 < k < N + 1 then the initial condition is |-0(O, e)> = t^|3>(0)), as in Eq. (47). 

Proof. Given above Eq. (47). ■ 

In AQC one envisions initializing the system in the ground state, which represents the input of the computational problem. 
Therefore, the last corollary confirms that we have the proper initial condition for AQC, since Assumption 3 guarantees that the 
target state is non-degenerate, and hence the global phase d does not matter. 

Moreover, we can make the error S2 vanish: 

Corollary 4 //ijC^) (1) = 0/or alll<k<N + l then S2 = 0. 
Proof. It follows from Eqs. (36) and (38) that 

N 

N N+1 

= |<i>(l)) + J2 e'H'til)) (53) 

i=i j=i 

Using Lemma 1 at ri = 1 and recalling that /o(t) = 1 along with the boundary conditions fj{l) ~ for j > 1 [Eq. (43)] we 
thus have 

g./„i£(r')dr'/.|^^(l^,))^|$(l))^ (54) 

from which it follows that i52 = 0. ■ 

C. Operator and state bounds 

In this section we derive bounds (many of which are not particularly tight) on the various operators and states that arise in the 
proof of Theorem 1 . 

From the definition (33) of the reduced resolvent and the definition of the operator norm as the maximum eigenvalue we 
immediately obtain 

"'^•■'-'ll- d..(W.)l,.(W))\W.))) ^'^^°"'"'- 
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Lemma 2 

|$(t)) = lGr{T)H{T)\^{T)), (56) 
E{t) = ($(T)|7j(T)|$(T)), (57) 

E{r) = ($(r)|H(r)|$(T)) + ($(r)|H(r)|$(T)) + ($(T)|/f(T)|$(T)). (58) 
zP^(r)|$(r)) = -Gr{r)[HiT)-EiT))mr))-2Grir)[HiT)-Eir))mr)), 

(59) 

Equation (57) is also known as the Hellmann-Feynman relation. 
Proof. From Eq. (32), we have: 

G,(r)H(r) = lP^iT)^Gr{T)iH{T)-E{T))+E{T)Gr{T) =^ (60) 

G.(r)ij(r)|$(r)) = -*|<i)(r))($(r)|$(r)) - *|$(r))(<i>(r)|$(r)) 

-G,(t) (i7(T) - E{t)) |<1>(t)) + ii;(T)G,(T)|$(T)). (61) 

The second term on the RHS of Eq. (61) vanishes because of the phase condition (37) [11]; the third term vanishes because 
H{t) — E{t) projects onto H^', the last term vanishes by Eq. (35). Thus using Eq. (33) we obtain the following formula for 

|$(r)): 

|$(t)) = iG,(T)7j(T)|$(T)) (62) 

where the last equality holds for the case of a discrete spectrum, i.e., when Eq. (6) applies. 

To obtain the Hellmann-Feynman relation (57), we differentiate the relations ^^(t) = {^{t)\H{t)\^{t)) and ($(t)|$(t)) = 
1. Another differentiation yields Eq. (58). 

To obtain Eq. (59), we differentiate the eigenvalue equation H\^) = E\^) twice. Thus: 

{H - E)\<l>) = -{H - E)\^) - 2{H - E)\^). (64) 
Multiplying from the left by G,. and using Eq. (32), we obtain 

iPi_\<l>) = -Gr{H - E)\<^) - 2Gr{H - E)\^). (65) 



Corollary 5 

ll^ll < A{n)p{n), (66) 

\\Pi_\\ < 2A{n)P{n), (67) 

||Pl|$)|| < 6A{nfl3{nf + 2A{n)Tj{n). (68) 

Proof. Equations (66) and (67) are immediate from Eqs. (19), (55), (56), and ||(E'(r)|| = 1. From the Hellmann-Feynman 
relation, Eq. (57), and the definition of the operator norm, we have: |i?(T)| < ||i7(r)||. Combining this with Eq. (58), we also 
obtain: \E{t)\ < 2||ij(T)|| ||$(t)|| + \\H{t)\\. Inserting these relations into Eq. (56), yields: 



\\P±m\\ < ||G,|| [\\H\\+max\E\ + 2{\\H\\+mi,x\E\)\mj . (69) 

Hence, 

\\P±\'^)\\ < IIG^II (2||H|| +6l|ij||||$||) <6A\n)(3^{n) + 2A{n)r]{n). (70) 

■ 

Remark 4 In Section VB we consider a general family of Hamiltonians relevant for AQC, for which we find that P{n) and rj{n) 
have the same scaling with n [Eqs. ( 125) and ( 126)]. In this case we have the following bound, after we upper bound 2A(n)ri(n) 
by 2A^{n)0^{n) (as is always possible for large enough n): 

\\Pi_m\<%A^{n)p''{n). (71) 
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Corollary 6 

G,(t)|$(t)) = ($(t)|G,,(t) =0, (72) 

mT)\Gr{T) = -($(t)|G,(t), (73) 

G,(r)|$(r)) = -G,(T)|<i>(r)), (74) 

Gr{T)P^{r) = P^{T)Gr{T)+lGr{T)[H{r)-E{T)]Gr{T), (75) 

||G'r(T)Pi(r)|| < 4A\n)p{n). (76) 

Proof. Equation (72) follows from Eq. (35). By differentiating Eq. (72) we immediately obtain Eqs. (73) and (74). From Eq. (32) 
we have 

Gr{T)[H{T) - E{t)] - tP^ir) - Gr{r)[H{T) - E{t)], (77) 
or after multiplying from the right by Gr and using Eq. (32) again: 

G,(T)Fi(T) = P^{T)Gr{T)+tGr{T)[HiT) - £;(t)]G,(t). (78) 

Bounding the norm of GrP± suffices for our analysis, for which Eqs. (55), (57), (66), and (78) yield: 

\\Gr{T)P±{T)\\ < 2A(3 ■ A + A ■ 2p ■ A ^ 4A^{n)/3{n). (79) 

■ 

In the next lemma, and later in Eqs. (97) and (111), is where we use the assumption of analyticity of the Hamiltonian. This 
assumption is crucial for our error bound. The key technical tool is the Cauchy integral formula (which requires analyticity), 
which links the 7Tith derivative of a function to its values in an explicit m-dependent way. 

Lemma 3 Let B{0) ~ 1 and B{k) = k'^, and let D{k) and D{k) be arbitrary functions such that D{k) > D{k) Vfc e N. 
Suppose ip{t) is an analytic vector-valued function in the domain Sy (Fig. 1). If lp{t) satisfies 

MT)\\<D{k)B{k){^-\l^{T)\)-\ (80) 

for some k > 0, then ^p{t) = dip{T)/dT satisfies 

||^(r)|| < D{k)Bik + 1)(7 - |Im(r)|)-('^+i). (81) 

Proof. We reproduce and slightly generalize the proof reported in Ref. [11] (Lemma 3.1). This proof is more general than what 
is required in our case (Assumption 1), and we include it for completeness. Assume that H{t) has poles or branch points at 
{Ti}, and let 

/i = min{|Im(T,)|}. (82) 

i 

We consider a scenario wherein H{t) can be analytically continued to the singularity-free open set (recall Fig. 1) 

S^ = {t: \t\ <ij}U{t: |Im(r)| < /i, Re(T) G [0, 1]} U {r : |r - 1| < (83) 

Define the circle Cr{t) — {r' G : |r — t'| = R{t) > 0}, centered at r and with radius R{t). This circle will shortly 
serve as an integration contour, and to ensure that Gii{t) is always inside the set S^, it suffices to choose max-r'^Ca \^'^{'''')\ ^ 
|lm(r) + R{t) I < fx. Taking, for example, R{t) = ^^—^j^, for some fc > 0, satisfies this requirement. Relative to Assumption 
1, where r = 1 and fJ. = j, all we require is i? < 7. 
The main idea is to use the Cauchy integral formula for the analytic function (p{t) to write 

1 / (p(t') 



where the circle Gfl,(T) has radius i?(T) = '^^t+i'^^ ' F^rr' e Gij,(T), we have |lm(T')| < |lm(T)| +i?(T), hence 7— |lm(T')| > 
^(7- |lm(T)|). Replacing this bound in II ^(r) II < D{k)B{k){-f - |lm(T)|)-'' [Eq. (80)], we obtain 
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Let D{k) be any function that upper-bounds D{k) for all k. Then this gives rise to 




k + 1 



k+l 



The case fc = follows from the same argument by replacing Cii{t) with a(7 — |lm(r)|), for an arbitrary a < 1. This 
results in the bound ||<^(t)|| < D~^{'j — |lm(r)|)^^. In our application of Lemma 3 we use D{k) = C(fc)y4"('=)/3^('=) and 
D{k) = C(A:)A'=('^)/3''('^), where c{k) > a{k) and d{k) > h{k), and where C{k) is given in Eq. (106), a{k) and h{k) are 
the functions defined later in Eq. (103), c{k) and d{k) are the functions defined later in Eq. (104), and A and (3 are defined in 
Eqs. (18) and (19) (both are > 1), respectively. ■ 



D. Bound on 5i_ 



The error term 5i is exactly the one which already appeared in Eq. (46). Our strategy is to bound the integral in Eq. (46) 
by using an inductive approach based on Lemma 3. So far we have only used the differentiability property of H{t), not its 
analyticity. If one wants to assume only that H{t) is + 1-times differentiable, it is still possible to find an upper bound 
for the integral in Eq. (46), of course in terms of A as well as norms of derivatives of H{t) — for an analysis based only on 
differentiability of the Hamiltonian, see, for example, Refs. [9, 12]. 

To apply Lemma 3, we first need to justify why |V'jv+i (''')) analytic in S-y. We first show that by Assumption 1, E{t) and 
|$(r)) are analytic functions inside 5*-^. To do so we recall the following theorem (modified slightly for our purpose here) [34]: 

Theorem 2 (Kato-Rellich theorem) Let Q{t) be a family of bounded operator-valued analytic functions in a region S. Let 
q(Ti) be a nondegene rate eigenvalue of Q(ti) — for our purpose, we take ti G M. Then, for t near ti, there is exactly one point 
q^r) of (j{Q{t)) near qiri) and this point is isolated and nondegenerate. q{T) is an analytic function ofr near ti, and there is 
an analytic eigenvector \q{T)). When Q{t) is self-adjoint for r — ri S R, \q{T)) is also normalizable. 

Corollary 7 |$(t)) and E[t) are analytic inside Sj. 

Theorem 3 (XII.7 [34]) Let Q{t) be a family of bounded operator-valued analytic functions in a region S. Then the resolvent 
R(t, z) of Q, for z G p{Q(t)), is an analytic function ofr in S. 

Corollary 8 G,(t) is an analytic function ofr inside S-y. 

Proof. Let us take an arbitrary z e p{H{t)). Then, by multiplying Eq. (32) by i?(T, z) from the right, we obtain 

Gr{T) [{H{t) -z)- {E{t) - z)] = zP^{t) =^ 



Gr{T) = j-^^P^{T)R{r,z) 



P^ir)Rir,z)-^ 



Note that, since (t(P_i (r)i?(T, z)) { e{t)-z '^ -^('^) ^ ^{H{t)) - {E{t)},z e p{H{t))], the inverse on the RHS exists. 
Moreover, the expression on the RHS is analytic in terms of R{t), which, together with analyticity of P±{t) in r G S-y, implies 
analyticity of Grir) in S-^. ■ 

Corollary 9 |'0jv+i (''")) analytic inside S^. 

Proof. Note that lip^ir)) = Gr(T)|<l>(r)) is analytic, because of analyticity of Grir) and |'I'(t)) in S^, and because differentia- 
tion preserves analyticity. Now, let us assume by induction that li/jj^ir)) is analytic. Analyticity of li^j'-^i (t)) is then immediate 
from Eqs. (38) and (44); 



Mr)) / dr'{^T')\i^fir)) + \i;^ir)) 



l<j<N, 



13 



and the RHS is a product and sum of analytic functions. ■ 

Now we can initiate the application of Lemma 3. We assume that r e M since we are only concerned with real time evolution 
in AQC. For notational simplicity we drop the r-dependence from here on. From Eq. (44) we have: 



1^. 



Nl 



GrP± 



fN^l\^) + \i^h-l) 



We will find ||V']vll by induction. To initialize the induction we use Eqs. (39) and (38): 



(55), (66) 



(55), (66), (67)^,(68), (76) 
(71), (79) 



We now assume by induction that 

UU\<C{N)giN)A'^<^^'>f3'''^^\ 
where a{N), b{N) and C{N) are functions we shall determine, and where 



9{N) = 



N-1 

1 



N-l 



[5(1) ^ !]■ 



(86) 

(87) 
(88) 
(89) 

(90) 



The form of g{N) comes from Lemma 3. ^ Lemma 3 determines that differentiation raises g{N) to g{N + 1). Setting 

D{N) = C(A^)A"(^)/3^(^) and D = C(iV)A=(^)/3''(^) in that Lemma, provided c{N) > a{N) and d{N) > b{N), we obtain 
the bound: 



U'nW < C{N)g{N + l)A<^^(3'^^^l 

We shall determine c{N) and d{N) below. 

Note that the initial conditions determined by Eqs. (87) and (88) are: * 

a(l)-2 6(1) = 1, 
c(l) = 3 rf(l) = 2, 
C(l) = l. 

We now use the induction hypothesis to write 

II^N+ill < C{N + l)g{N + i)A^(N+i) pHN+i) ^ 
while on the other hand we have from Eq. (86) 

Uh+lW = \\Gr[fN\^) + \i'h)]\\ 

<\\Gr\\[\fN\\m + \H'N\\]- 

The inductive proof consists of showing that the bounds (95) and (96) are the same. To do so we first need to bound |/Ar| 



(91) 

(92) 
(93) 
(94) 

(95) 
(96) 



I/a^(^)I = 



($(r')K'^(r'))rfr' 



< 



dr' 



Hr'MNir')) 
<T- sup ||$(T')|||l^^(r' 

T'e[0,r] 

<1-Al3- C(iV).g(Ar)yl"(^)/?''(^\ 



(97) 



^ Note that we have set Im(r) = because we are on the real axis. 

* Note that the g{2) factor present in Eq. (91) (evaluated at A'^ = 1 and absent in Eq. (88)) gives rise to a discrepancy between the two bounds, unless we set 
7 = 1/14. This does in fact not impose a constraint on the family of Hamiltonians our proof applies to (recall Assumption 1), since in the application of 
Cauchy's theorem we are free to choose an arbitrarily small integration contour around the real-time axis. In spite of having thus fixed its value, we continue 
to write 7 rather than 1/14, as there is no fundamental importance to this value; it is merely an outcome of our rather loose bounds, e.g., as in Eq. (71). 
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where in the last step we used the induction hypothesis again, and hence also the analyticity assumption. Using this bound in 
(96) together with (91) we find: 



N+l\ 



< A 



C(Af)(7(iV)A'^(^)+i/3''W+i • A/3 + 1 • C{N)g{N + 1)A'=W/3'^W 
= C{N) [.9(/V)A'^W+3/3''(^)+2 + g{N + i)A^(^)+^(3diN) 



(98) 



In the last inequality we multiplied the second term by {Af3)'', where fc > 1 is a constant, in order to allow for an adjustment to 
fit the initial conditions; see below. In order to complete the inductive proof the two bounds (95) and (98) should agree, and for 
this it is sufficient that their RHSs are equal: 



C{N + l)g{N + i)A'^{N+i)f^biN+i) ^ g(iV)A''W+3^^W+2 + g{N + i)A-(A^)+i+fe/3'i(A^)+fc 

Since A and /? are arbitrary this requires that 

C{N + l)g{N + 1) = C{N) [g{N) + g{N + 1)] , 

while the terms involving A and (3 must have equal powers, which implies: 

a{N + 1) = a{N) + 3 = c{N) + 1 + fc, 
b{N + 1) = b{N) + 2 = d{N) + fc. 

From these last two equations, together with the initial conditions (92) we easily find: 

a(A^)=3A^-l, b{N)^2N-l. 
We also have c{N) ^ 3N + 1 - k and d{N) = 2N + 1 - k. The initial conditions (93) then yield fc = 1, so that: ^ 

c{N) = 3A^, d{N) = 2N. 

Next we need to solve for C{N) from Eq. (100), subject to the initial condition C(l) = 14. We have, using Eq. (90): 

\N-1 \ 



C{N + 1)^C{N) 1 + 7 



(99) 



(100) 



(101) 
(102) 



(103) 



(104) 



(105) 



whose solution is 



cm = n 1 



(106) 



We can upper-bound C{N) as follows: 



N-l 



C{N)<1[ 1 + 7-, 

j=i ^ 



N-l . , N-l . , 



J 



J 



(107) 



^ Another way to understand the need for the adjustment in the last hne of Eq. (98) comes from this example: 

< IIGr-ll [l/l|ll'j>ll + ||Px||||t/.^||] 



To get the last inequality we multiplied the term A^fS^ by {A/3)'' with k = I. This is required in order to obtain a bound involving just a single power of A 
and of /3. Failing to do this allows for the possibility that the two bounds (95) and (98) will not agree. 
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where m = [7] (smallest integer larger than 7). Thus 



Af-l 



CiN) < n ^ = + + (108) 

N N 

< (iV+l)(- + l)...(- + l) 

2 m 

< {N + < {N + iy+\ (109) 
By collecting all our results and inserting them into Eq. (91) we have, so far: 

ii^^ii<(A^+ir+^(^^)''. (110) 



From Eq. (46), we now have: 



AN{r)<r- sup \\i'j,^,is)\\ < (N + 2)''+' / (VV + 1)^3/3^ \ 

0<S<T<1 V 7 / 



(111) 



Reinserting dimensional units (i.e., H = h/J and A = d/ J) we have 

5i ^ 111^(1, e)) - |vl/jv(l,e))|| <AAr(l)e^+i 



where h= 

dr 



Thus picking T as 



T=l{N+l)^, (113) 
7 d'' 



where g > 1 is a "time dilation factor", gives: 

(5 < (TV + 2)''+ig-(^+i) (114) 

which can be made arbitrarily small in the number + 1 of zero derivatives of the Hamiltonian. We have thus proved Theorem 
1 (where iV + 1 is redefined as N). 

V. DISCUSSION 

A. Comparison to the results of Jensen, Ruskai, and Seller [12] 

In Ref. [12], Jansen, Ruskai, and Seller (JRS) proved — using the methodology of Avron et al. [10] rather than the asymptotic 
expansion [11] — a number of adiabatic theorems, all of which made weaker assumptions than ours. In particular, they did not 
assume analyticity. For example, their Theorem 3 can be summarized as follows. Assuming that /i(t) is C^, that \\h\\ and \\h\\ 
are both bounded, and that h{0) = h{l) ~ 0, then provided 

T^qJ^ + 7mx/^li^ j dr, (115) 

the error at the final time can be made arbitrarily small in the "time dilation factor" q > 1: 

6<q-^. (116) 
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Here doir) = JAo(t) is the instantaneous dimensional minimal gap [Eq. (8)], and the parameter TO(r) is the number of distinct 
eigenvalues in the spectrum of h restricted to the target subspace. 

It is interesting to compare this result to our Theorem 1. In our case, by assumption the target subspace is one-dimensional, 
so m = 1 (though this does not appear to be fundamental to our analysis). Furthermore, we have in various places replaced 
integrals over r by the supremum of their integrand. In hght of this, we would have written Eq. (1 15) as 

T^7q^, (117) 

which is indeed very similar to Eq. (22), except that the A^-dependence is now absent. The reason for this is, of course, that JRS 
did not consider the case of N vanishing time derivatives. The fact that our error bound (23) is much tighter than JRS's (116) is 
again due to analyticity, which allowed us to introduce the N parameter into the error bound. 



B. System-size dependent bounds for local Hamiltonians 

Due to the appearance of sup^^jQ ^ II^IP in the expression for the adiabatic time (22), we now present norm bounds for 
physically relevant Hamiltonians, with the purpose of exhibiting the explicit system-size dependence to the extent possible. This 
is particularly relevant for AQC. 

Let us consider the case when /i(r) is an L-local n-body Hamiltonian (L < n). An L-local Hamiltonian contains interaction 
terms involving at most L bodies, for some fixed L. For example, in the 7i-qubit case let {cr} be the operator basis constructed 
from tensor products of the Pauli matrices ax, (Jy, (Jz, and the identity matrix /. Then we can expand the Hamiltonian as: 

H{r)=J2Ur)rT, (118) 

cr 

in which all ^o- are real functions. This expression for H{t) is in fact a very general "interpolation" Hamiltonian, which 
captures many of the examples considered in the AQC literature, such as the common linear interpolation Hamiltonians of the 
type H{t) = (1 - ^{t))Ho + £,{t)Hi, where and Hi are fixed, n-qubit Hamiltonians, and ^(0) 0, C(l) 1 [5, 6, 20-22]. 
It also captures the unitary interpolation H{t) = J7(r)iJ(0)C/t(r) [16], where U{t) is unitary; this can be seen by Taylor 
expansion of U{t). 

For an L-local Hamiltonian, by definition ^o- = whenever the (Hamming) weight of the corresponding cr — the number of 
non-identity terms in the tensor product — is greater than L. The number of independent real parameters is [35] 

^ „■ L<n/2 ^ ( n 



3=0 

In most physically relevant systems L — 2, i.e.. 



In such cases, we have 



Putting together all these elements we find 



Hir)^Y.^,{r) + Y,VjAr)- (120) 

J = l 3<j' 



#(n,2) = (Qn^ - 3n + 2)/2. (121) 



||i/(r)|| <#(n,2).sup|e^(T)|. (122) 

cr 

Therefore, the overall upper bound for the scaling in terms of n is as follows: 

\\H{t)\\ =o(n^- sup le.(T)l) <o(n^- sup \^^{t)\] . (123) 



(T.T 



With a similar analysis, we also obtain 



(124) 
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where {■Y'^^ = ^rr (•)■ Usually, in physical systems of interest in condensed matter or quantum information, there is a predefined 
lattice or graph structure that dictates the spatial configuration of the system. Typically, there are n spins arranged on a ID, 2D, 
or 3D lattice or graph. Increasing n in such systems means adding new particles only to the surface or boundary of the lattice. 
That is, by construction, a new particle cannot occupy a position inside the lattice, unless a structural defect is present. Even if 
there are structural defects in the system, it is plausible to assume that the number of defective sites is very small relative to the 
total number of the particles in the system, or this number may be a constant independent of n. Assuming that the interactions 
are only local (2-local in space and local in time) and short-ranged, therefore, new particles would not change the coupling 
strengths between the particles far enough from the surface. ^ This argument implies that in systems with a time-independent 
^ and non-defective spatial graph/lattice structure, and short-ranged, local interactions, the coupling strengths (or interpolation 
functions) S^cr (t) do not depend on the system size n, for large enough n, and we have: 

\\H{t)\\< sup |^^(r)| .0(^2), (125) 

cr.r 

||ijW(T)|| < sup|d'=)(T)|.0(n2). (126) 

cr,T 

That is, the norm of H{t) and all of its derivatives scale as 0{n^), but with different coefficients. 

A nice feature of the interpolation Hamiltonian (118) is that its analyticity is determined entirely by the scalar interpolation 
functions {C(t(t)}ct- Let j^r denote the height of the analyticity domain of ^ct(I)- Then 

7 = inf7o. (127) 

cr 

is the height of the analyticity domain appearing in Theorem 1 . Clearly, if the interpolation functions (t) do not depend on 
the system size n because the system is confined to a graph with time-independent geometry and topology, nor will 7 (recall 
Remark 2). 

The next issue concerns the scaling of the gap with n. As pointed out, e.g., in Ref. [21], how the gap scales depends on 
whether one is dealing with a first or higher order quantum phase transition (QPT). First order QPTs are typically associated 
with exponentially small gaps, while higher order QPTs are associated with polynomially small gaps (in both cases, as a function 
of n). Consider first the latter, i.e., 

d{n) - Jn-^ (128) 

where z > is the "dynamical critical exponent" [36]. We can now apply these considerations to Theorem 1, and find: 

Corollary 10 Under the same assumptions as in Theorem 1, and assuming a second order quantum phase transition [i.e., a gap 
that scales as in Eq. ( 128)], a time scaling as 

T^-N^ — ^—n"^-^^, (129) 

7 J-* 

yields an adiabatic approximation error which satisfies 5 < {N + ip^^q^^") (independently ofn). 

A different situation arises in the context of the adiabatic version of Grover's problem [6], which is an example of a first order 
QPT. In Ref. [12], condition (1 15) was applied in this setting, where the Hamiltonian has the following form: 



Hg{t) (1 - x{r))il - 10) (01) + x(r)(/ - |m)(m|), (130) 
m G 

dependence on the number of qubits n: 



(where |0) = J2'i=o ^ N) / V^^" and jti G {0, . . . , 2" — 1}) and the time-dependent spectral gap do is found to have the following 



do{n, t) = Jv/2-" + 4(1 - 2-")(2;(r) - 1/2)2 (131) 



^ Note that ^o- (t) may in general depend on n. One can see this through a simple example. Imagine a cyhnder of gaseous particles with short-ranged 
interactions. Any particle will interact with all pai'ticles inside a sphere of radius rjnt — the range of the interaction — around it. If we add new pailicles to 
the cylinder, at some point (i.e., at some n) all the space inside the shell will be occupied (close-packed), hence, the new particles cannot interact with the 
particle in the center. For such particles, the coupling strength of the interaction with the particle in the center is effectively zero. 

' This condition is designed to exclude a folding of the system lattice, such as protein folding in the case of polymers or DNA molecules. 
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The minimum gap is encountered at the critical point, where the gap scales as A(n) = 0(2~"/^). For this problem, condition 
(115) gives T = 0{d^^) for constant error It is important in deriving this result that the function x{t) is smooth (C°°). This 
result is much more appeahng in terms of its n-dependence than the general estimate (115), but it relies on the fact that the 
norm of the Hamiltonian does not scale with n: \\Ha\\ < 1 + 2 sup^ which is not the generic case. In the setting of our 

Theorem 1, it is clear that we would find T ~ 0{d~^) not for constant error, but for an error that can be made arbitrarily small. 
The assumption of a smooth Hamiltonian is of course compatible with our assumption of analyticity. 

Recently, Ref. [23] derived another error estimate for AQC that relies on smooth interpolation and results in the same estimate 
for the running time: T = 0{d^^). To obtain this result Ref. [23] assumed again that the norm of the Hamiltonian is bounded 
above by a constant. Reference [23] also considered the case of a constant gap and highly degenerate first excited state (i.e., 
a Hamiltonian whose norm depends on n), and argued numerically that for a smooth interpolation it is possible to obtain an 
exponential error estimate: S = 0(nexp(— Td)), whence a running time T = 0{d~^ Inn) suffices for arbitrarily small error 
This case, though, is again non-generic. The generic situation is one in which the Hamiltonian couples all the states in the 
spectrum, and the spectral gap closes with n. 

C. AQC related issues 

In this subsection we collect a couple of observations related to the relevance of Theorem 1 to AQC. 

First, as a general rule, an advantage in performing AQC over classical computation can only be guaranteed if one has a 
priori knowledge of the final time T, which is presumably shorter than the time required for the execution of the corresponding 
classical algorithm. Similarly, in our setting, in order to be able to set the final-time derivatives of the Hamiltonian equal to zero, 
one needs to know the final time T. Thus both in the general AQC setting and in our case one would like to know the gap d (as 
well as the other, more easily computable quantities appearing in Theorem 1). While this is sometimes amenable to an analytical 
solution, it is in almost all cases a very difficult problem. Fortunately, in AQC the gap is known exactly if one starts from a 
quantum algorithm given in the circuit model, by mapping this algorithm to the adiabatic model. In such a case the gap is an 
easily computable function of the number of gates [15, 37]. However, this result relies on a physically unreasonable Hamiltonian 
containing 5-body interactions, and recent results which map such Hamiltonians to physically reasonable 2-body interactions 
(e.g., Refs. [17, 18]), use so called "perturbative gadgets", which involve an approximation wherein the exact expression for 
the gap is lost. If one does not know the final time T exactly, one can still attempt to compute an estimate Tg for T, and set 
the final-time derivatives of the Hamiltonian equal to zero at T^. Provided Te > T, stretching the adiabatic evolution in such a 
manner cannot result in a worse error than promised by Theorem 1 for T. 

Second, in the context of AQC one would like to measure the final state, in order to extract the answer to the computation. For 
this reason it makes sense to simply make the Hamiltonian constant for t starting from a value slightly smaller than T — which 
implies that the derivatives of the Hamiltonian at the final time T vanish. This, of course, automatically satisfies the requirement 
of vanishing final-time derivatives. One cannot make the same argument about initialization of the computation, however: 
initialization is a dynamic process (e.g., cooling into the ground state), so that one cannot keep the Hamiltonian constant for all 
t < 0. 



D. The open system case 

Suppose that our quantum system of interest S is coupled to another system B, which acts as an environment or "bath". This 
is the setting of open quantum systems [38]. Together, system and bath are described by the Hamiltonian 

h{t)^hs{t)®lB + hsB + Is®hB, (132) 

where hs, hss, and hs are the system, system-bath, and bath Hamiltonians, respectively, and / is the identity operator. The 
joint Hilbert space is 7i = Tis ® TLb, where Tis and TYb are the system and bath Hilbert spaces. Then hss '■ 'H ^ TL, 
hs : Tis ^ Ti-s, and hs '■ Hb "He- In our analysis above we only considered hs{t), which implements the adiabatic system 
evolution in the present case. We assume that hsB and hs are time-independent Hamiltonians. This is a reasonable physical 
assumption in many cases [38]. 

Coupling of the system to the bath introduces decoherence, and modifies the adiabatic condition relative to the closed system 
case we have discussed thus far [39-42]. We are interested in the adiabatic theorem which describes the system state alone. 
To this end, we need an appropriate distance measure. The trace distance is defined as D[pi,p2] = ~ P2II1, where 

|j A|j 1 = Tr|A|, 1^1 = V A'^A, and is a good distance measure between states (or density matrices) pi and p2 acting on the same 
Hilbert space [14]. A useful fact is that taking the partial trace can only decrease the distance between states [14], i.e., if pi and 
P2 are states in the joint system-bath Hilbert space H, then 



L'[TrBpi,TrBp2] < D[pi,p2], 



(133) 
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where Tr^ is the partial trace operation over the bath Hilbert space: TrB[|s)(s'| (g) \b){b'\] = {b'\b)\s){s'\, for arbitrary states 
|s), |s') £ Hs and |6), |6') G Hb- Inequality (133) can be understood intuitively as a consequence of the fact that by erasing 
information (taking the partial trace) one cannot make states more distinguishable, i.e., their distance cannot increase. 

Consider first the uncoupled setting hsB = 0, which we denote by the superscript 0. The target adiabatic system 
state is = \^{t)){^{t)\. Let p1^{t) = /3s,ad(*) ® PbW denote the "target adiabatic joint state," with p%lt) = 

e~'''^*/5^(0)e*''^*. Let p{Q) denote the initial joint system-bath state. The actual state whose time evolution is generated 
by h{t) [Eq. (132)] is p{t) U {t) p{0)U {1^ , where U{t) = T e-^/o is the propagator of the joint system-bath dynam- 
ics, with T denoting time ordering. The actual time evolved system state is ps{t) = TrBp{t). Using Eq. (133), we have the 
following inequality: 

Ss ^ D[ps{T),p%,^{T)] < D[p{T),pl^{T)] ee Ssb- (134) 

The distance 6s is the distance of interest: it is the distance between the actual system state and target system adiabatic state. 
The last inequality shows that it is upper-bounded by the distance 5sb between two "closed-system" states, where closed refers 
here to the joint system-bath entity. Because of this, we already know the form of the adiabatic theorem for Ssb- This is just 
Theorem 1 again, with h as prescribed in Eq. ( 1 32). It follows from Eq. ( 1 34) that we can use this upper bound on Ss b to bound 
6s as well. To be explicit, let us state the theorem we thus obtain for the open system case: 

Theorem 4 Let d denote the minimum gap of the full Hamiltonian h{t) in Eq. (132). Given assumptions 1-3 on h{t), assuming 
Kb and hsB cire time-independent, and that the first N + 1 derivatives of the Hamiltonian vanish at t = and t ~ l,a final 
time T which scales as 

q sup pro J, ||ft,5|p 

T=-N liiH^L^ [L^ (135) 

7 d-^ 

where q > 1 is a free parameter, yields an adiabatic approximation error which satisfies: 

5s<5sB<{N +lV+^q-'' (136) 

Remark 5 There is an important difference between the closed and open system cases: the minimum gap d in the open system 
case is the gap for the full system-bath Hamiltonian (132), which can be expected to be significantly smaller than for the isolated 
system, since generally, due to its much larger number of degrees of freedom, the bath will introduce many intermediate levels 
inside the gap depicted in Fig. 2 for the isolated system. This means that T can be expected to be very much larger in the open 
system case than for the same system without coupling to a bath. See also Ref. [39] for a different approach leading to the same 
conclusion. 



VI. CONCLUSIONS 



In this work we aimed to bridge a gap between rigorous formulations of the quantum adiabatic theorem, leading to expo- 
nentially tight error estimates, and the field of adiabatic quantum computation (AQC), where knowledge of the way the energy 
gap enters the conditions for the adiabatic approximation is crucial. To this end we have presented a version of the quantum 
adiabatic approximation that is useful for AQC, where there is a single non-degenerate ground state, the number of subsystems n 
is variable, and where the interpolation from the initial to the final Hamiltonian is fully controllable, at least in principle. In this 
case, we have shown that for a total time T scaling as the product of the cube of the inverse gap and the square of the operator 
norm of h, the error in the adiabatic approximation can be made exponentially small. Since our version of the quantum adiabatic 
theorem explicitly accounts for the system size dependence (see, e.g.. Corollary 10), this represents an advance over previous 
adiabatic theorems, where either the approximation error or the system-size dependence is not nailed down. 

Our results imply that as long as our key assumption of analyticity of the interpolation in a domain can be satisfied, along 
with a degree of control that allows setting the initial and final time derivatives of the Hamiltonian equal to zero, then from a 
closed-system perspective AQC has an important fault tolerance advantage over the circuit model of quantum computation [14]. 
Namely, whereas in the circuit model even unitary deviations from a prescribed set of gates can ruin a quantum algorithm, in 
AQC large deviations are permissible, as long as the interpolation ends at the desired final Hamiltonian, whose ground state 
encodes the answer to the computational problem one is trying to solve. Of course, this should not be misinterpreted as a claim 
that AQC is fully fault tolerant. It is well known that AQC is vulnerable to interactions with the environment [40, 44—48], and 
only preliminary steps have been taken towards a theory of fault tolerant AQC in an open systems setting [43, 49]. We have also 
reported a corollary regarding the adiabatic theorem for open quantum systems (Theorem 4), which shows that the time-scale 
for adiabaticity is determined by the gap of the full system-bath Hamiltonian. 

There are indications that the adiabatic approximation fails for Hamiltonians with several independent time-scales [50] (see 
also Ref. [12]). This presents an interesting problem for AQC, even in our setting of a closed system with analytic Hamiltonians. 
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For example, consider a situation where there is some smooth control noise on the interpolation functions Co- ('''), which has an 
independent time-scale. Then merely slowing down the evolution by elongating T will have no impact on this noise, so that 
in its presence the time dilation-based error bound (23) cannot be expected to apply. In other words, noise with an intrinsic 
time scale that cannot be stretched in the sense that makes e in the asymptotic expansion (36) small, generates a violation of the 
assumptions used to derive the adiabatic theorem. Future work on fault-tolerant AQC should address this problem. 
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APPENDIX A: PROOF OF LEMMA 1 

We first need the following Lemma: 

Lemma 4 IfH^''^ (ti) = 0/or all I < k < N and for some n € [0, 1], then 

E'^''\ti) ^ p(''\Ti) = |$('=)(ri)) =0, fee {l,...,iV}. (Al) 

Proof. It is well known that for a closed operator H{t) with E{t) an isolated point of a{H{T)) (i.e., suppose that for some e: 
(j{H) U {E' : \E — E'\ < e} = {E}), the corresponding projection can be written as 

Pir) = T^(f dE'R{T,E'), (A2) 

27ri J\E-E'\=r 

for any r E (0, e) [34], where R{t, E') is the full resolvent of H{t). Since E{t) is the eigenvalue of H{t) associated with the 
eigenstate |$(t)) we have: 

E{t)^Tv[H{t)P{t)]. (A3) 
By differentiating Eqs. (A2) and (A3) and using Eq. (30), we obtain 

P{t) = 5-/ dE'R{T,E')H{T)R{T,E'), (A4) 

27ri J\E-E'\=r 

E{t) = Tv[H{t)P(t)]+Tv[H{t)P{t)]. (A5) 

Thus making H{ti) = implies P{ti) = 0, which in turn, from Eq. (A5), implies E(ti) = as well. Note that P(ti) — 
also implies Pi(ti) = 0. Moreover, Eq. (56) yields: 

|$(ti)) = iG.(ri)i/(ri)|$(ri)) = 0. (A6) 

By simple applications of the Leibniz rule {{X ■ Yf"^ = Y!i=o (^)^^'^ ' Y^''-^\ for any pair of (differentiable) objects), it can 
be seen that these conclusions hold also for higher order derivatives. We show this explicitly for j^C^) We obtain 

|$('=)(ri)) =^Y. In- (A7) 

All terms within the summation include a derivative of H. The highest derivative in the RHS is H'^^^ . This is zero for all k < N. 
That is, |$('')(ti)) =0, ke{l,...,N}. ■ 
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Corollary 11 Under the assumptions of Lemma 4, we have 

H^^^ir,) = =^ Gi'^Hn) =0, ke{l,...,N}. (A8) 

Proof. Using Eqs. (74) and the definition of we have GrP± = + |$) ($|. Thus, using Eqs. (75) and Lemma 4 we find: 

Grin) = -G,(ri)|$(ri))($(ri)| + P^{n)Gr{n) + iGr{n)[H{n) - E{n)]Gr{Ti) = 0. 
From the Leibniz rule we obtain 

4=0 

k-1 



+ J2f^-')p^l^'\r,)Gr^~Hr^) 



4=0 

fe-1 



+^T.(\^)GiHri){[H-E]Grf''~^\r.. (A9) 



1=0 



From Lemma 4, all the terms within the summations vanish for all fc < . ■ 
We are now ready to give the proof of Lemma 1 . 

of Lemma 1. We note that |V']^(t)) = Gr(T)|$(T)) -Gr(r)|<I>(r)), from which by using Corollary 11 we obtain 

|VHn)>= -G,(ri)|$(Ti)) =0, (AlO) 

l^Uri)) = -G.(ti)|$(ti)) - G,(ti)|$(ti)) = 0, (All) 

or in general, using the Leibniz rule: 

5^|^i^(ri)) = - ^ ('^)G('+i)(ri)|$('^-^)(ri)) = ,0 < < iV- 1, (A12) 



1=0 



since each term inside the summation vanishes as long as Gr*^^^ (1) = 0, i.e., i + 1 < iV, or, fc = — 1. 

We now show by induction that if 9^|V'/_i(ti)) = for < fc < - - 1), then d';\i'f{Ti)) = for < k < N - j. 
The calculation above initialized the induction for j = 2. We have by Eq. (44): 

9^|V,^(ri)> = -/,-i(ri)(G,.|<i>))^'^-^f^)/«i(ri)(G,.|<I>' 



1=1 

fe 



+ E (J)G«(n)|^f_r+^^(n)) +G.(n)|^f_t^^(ri)). (A13) 

As long as k < N all the terms within the summations are zero, because each of them contains a derivative of Gr at most up 

to order A^. The first term generates a gI'^'^^^ti). When fc + 1 < A^, this term also vanishes. By assumption, the last term 
vanishes when fc + 1 < A^ — — 1). Since j > 2 this implies that all terms vanish when k < N — j. Overall, we have shown 
that 

a^:|^/'/(Ti)) = 0, for 1 < j < A^, < A; < ^ - j. (A14) 
Immediate corollaries of this result are as follows: 

|^/(ri)) = 0, l<j<A^, (A15) 

/,(ri)|$(ri)), l<j< N, (A16) 

a^7, (ri) ^■^''^ 0, 1 < J < TV, 1 < fc < A^ - J. (A17) 

This concludes the proof. ■ 
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